Second Language Information Transfer in Automatic Verb Classification – A
نویسندگان
چکیده
Second Language Information Transfer in Automatic Verb Classification – A Preliminary Investigation Vivian Tsang Master of Science Graduate Department of Computer Science University of Toronto 2001 Lexical semantic classes incorporate both syntactic and semantic information about verbs. Lexical semantic classification of verbs provide a great deal of useful information about the possible usage of each verb. In our work, we explore the use of multilingual corpora in the automatic learning of verb classification. We extend the work of Merlo and Stevenson (2001a), in which statistics on simple syntactic features extracted from textual corpora were used to train an automatic classifier for three lexical semantic classes of English verbs. We hypothesize that some lexical semantic features which are difficult to detect superficially in English may manifest themselves syntactically in another language. In our two-way classification task, features from multiple languages achieve an accuracy as high as 81%, making a small bitext a useful alternative to using a large monolingual corpus for verb classification. In this thesis, experimental results are presented and future extensions are discussed.
منابع مشابه
Hybrid Methods for Acquisition of Lexical Information: the Case for Verbs
Improved automatic text understanding requires detailed linguistic information about the words that comprise the text. Particularly crucial is the knowledge about predicates, typically verbs, which communicate both the event being expressed and how participants are related to the event. Although the field of natural language processing (NLP) has yet to develop a clear consensus on guidelines fo...
متن کاملCross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation
Existing approaches to automatic VerbNetstyle verb classification are heavily dependent on feature engineering and therefore limited to languages with mature NLP pipelines. In this work, we propose a novel cross-lingual transfer method for inducing VerbNets for multiple languages. To the best of our knowledge, this is the first study which demonstrates how the architectures for learning word em...
متن کاملWhich are the Best Features for Automatic Verb Classification: a Corpus Study of Levin Verb Classification
Much research in lexical acquisition has concentrated on the hypothesis that the behavior of a verb, particularly with respect to the expression of arguments and the assignment of semantic roles is to a large extent driven by deep semantic regularities. We focus on the empirical basis for a central part of this research, Levin (1993), (which we abbreviate as Levin from now on). The core of Levi...
متن کاملA Multilingual Paradigm for Automatic Verb Classi cationPaola
We demonstrate the beneets of a multilingual approach to automatic lexical semantic verb classiication based on statistical analysis of corpora in multiple languages. Our research incorporates two interrelated threads. In one, we exploit the similarities in the crosslinguis-tic classiication of verbs, to extend work on English verb classiication to a new language (Italian), and to new classes w...
متن کاملTask-Specific Artifacts of Parametric Properties in English as a Second Language Acquired by Persian-Speaking Learners
This experimental study investigated the learners’ integrative acquisition of obligatory overt subjects and subject-verb clause agreement in English as an L2. In L1 acquisition research, correlations between superficially unrelated linguistic phenomena are analyzed in terms of integrative effects. For instance, in English L1 acquisition, there is evidence for an integrative appearance of subjec...
متن کامل